Picture for Weihua Chen

Weihua Chen

Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization

Add code
Jun 01, 2026
Viaarxiv icon

Lumos-Nexus: Efficient Frequency Bridging with Homogeneous Latent Space for Video Unified Models

Add code
May 29, 2026
Viaarxiv icon

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

Add code
Mar 20, 2026
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

The best performance in the CARE 2025 -- Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation

Add code
Oct 05, 2025
Figure 1 for The best performance in the CARE 2025 -- Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation
Figure 2 for The best performance in the CARE 2025 -- Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation
Figure 3 for The best performance in the CARE 2025 -- Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation
Figure 4 for The best performance in the CARE 2025 -- Liver Task (LiSeg-Contrast): Contrast-Aware Semi-Supervised Segmentation with Domain Generalization and Test-Time Adaptation
Viaarxiv icon

RealisMotion: Decomposed Human Motion Control and Video Generation in the World Space

Add code
Aug 12, 2025
Viaarxiv icon

Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation

Add code
Jun 06, 2025
Figure 1 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 2 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 3 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Figure 4 for Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation
Viaarxiv icon

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Add code
Jun 06, 2025
Viaarxiv icon

On Denoising Walking Videos for Gait Recognition

Add code
May 24, 2025
Viaarxiv icon

RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild

Add code
Apr 21, 2025
Figure 1 for RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild
Figure 2 for RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild
Figure 3 for RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild
Figure 4 for RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild
Viaarxiv icon